Excape WP1. Conformal Predictors
نویسندگان
چکیده
The report summarises some preliminary findings of WP1.4: Confidence Estimation and feature significance. It presents an application of conformal predictors in transductive and inductive modes to the large, high-dimensional, sparse and imbalanced data sets found in Compound Activity Prediction from PubChem public repository. The report describes a version of conformal predictors called Mondrian Predictor that keeps validity guarantees for each class. The experiments were conducted using several non-conformity measures extracted from underlying algorithms such as SVM, Nearest Neighbours and Näıve Bayes. The results show (1) that Inductive Conformal Mondrian Prediction framework is quick and effective for large imbalanced data and (2) that its less strict i.i.d. requirements combine well with training set editing algorithms such as Cascade SVM. Among the algorithms tested with the Mondrian ICP framework, Cascade SVM with Tanimoto+RBF kernel appeared to be best performing one, if the quality criteria are precision, recall and number of uncertain predictions. The report also describes briefly the parallelization approach that allowed to distribute the computational load and reduce execution time.
منابع مشابه
On the Calibration of Aggregated Conformal Predictors
Conformal prediction is a learning framework that produces models that associate with each of their predictions a measure of statistically valid confidence. These models are typically constructed on top of traditional machine learning algorithms. An important result of conformal prediction theory is that the models produced are provably valid under relatively weak assumptions—in particular, the...
متن کاملConformal Prediction for Reliable Machine Learning: Theory, Adaptations, and Applications
An appealing property of conformal predictors is their automatic validity under the exchangeability assumption: they make an error with probability not exceeding the prespecified significance level. A major focus of this chapter will be on conditional versions of the notion of validity. This requirement will be introduced in Section 2.1 and studied further in Sections 2.2, 2.4, 2.6, and 2.7. Ot...
متن کاملApplication of Conformal Predictors to Tea Classification Based on Electronic Nose
In this paper, we present an investigation into the performance of conformal predictors for discriminating the aroma of different types of tea using an electronic nose system based on gas sensors. We propose a new non-conformity measure for the implementation of conformal predictors based on Support Vector Machine for multi-class classification problems. The experimental results have shown the ...
متن کاملConformal Predictors for Compound Activity Prediction
The paper presents an application of Conformal Predictors to a chemoinformatics problem of identifying activities of chemical compounds. The paper addresses some specific challenges of this domain: a large number of compounds (training examples), high-dimensionality of feature space, sparseness and a strong class imbalance. A variant of conformal predictors called Inductive Mondrian Conformal P...
متن کاملIntroduction to Conformal Predictors Based on Fuzzy Logic Classifiers
In this paper, an introduction to the main steps required to develop conformal predictors based on fuzzy logic classifiers is provided. The more delicate aspect is the definition of an appropriate nonconformity score, which has to be based on the membership function to preserve the specificities of Fuzzy Logic. Various examples are introduced, to describe the main properties of fuzzy logic base...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015